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cDNA CLONE FOR SOUTH AFRICAN ARBOVIRUS NO. 86 

Field of the Invention 

The present invention relates to live attenuated vaccines in general, 
and particularly relates to attenuated vaccines produced from South African 
Arbovirus No. 86 (S.A.AR86) vims. 

5 Background of the Invention 

This invention was made with government support under Grant No. 
2-ROIA122186 (09-13) awarded by the National Institutes of Health. The 
government has certain rights in the invention. 

South African Arbovirus No. 86 (S. A. AR86) is an isolate of Sindbis 

10 virus. S.A.AR 86 was originally isolated from mosquitoes. The virus is 
antigenically related to Sindbis virus and to two other Sindbis virus isolates, 
Girdwood S.A. and Ockelbo82. See, Malherbe et ah, South African Medical 
Journal, 37:547 (1963) and Niklasson et al., Am. J. Trop. Med. Hyg. 33:1212 
(1984), respectively. The latter is associated with a human disease, also known 

15 as Ockelbo. Sindbis virus is the prototype member of the alphavirus genus of the 
family Togaviridae. The Sindbis virus includes various strains, including 
S.A.AR86 and Sindbis AR339. The genome of Sindbis viruses consists of a single 
strand of RNA which contains the information for the viral genes, and which is 
infectious when introduced into the cytoplasm of cells. 

20 Full-length cDNA clones of positive-strand RNA viruses are 

important tools for the study of the biology of viruses including Sindbis viruses. 
It is known with respect to several viral systems that in vitro transcripts of cDNA 
clones, and in some cases the cDNA itself, can initiate a complete and productive 
infectious cycle upon introduction into susceptible cells. See Racaniello et al., 

25 Science 214:916 (1981); Ahlquist et al., Proc. Natl. Acad. Sci. USA 81:7066 
(1984); Kaplan et al., Proc. Natl. Acad. Sci. USA 82:8424 (1985); Mizutani et al., 
/. Virol. 56:628 (1985); van der Werf, Proc. Natl. Acad. Sci. USA 83:2330 
(1986); Rice et al., /. Virol. 61:3809 (1987); and Vos et al., Virology 165:33 
(1988). This has made it possible to test progeny virus for phenotypic 
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manifestations of directed mutations and recombinations which have been 
introduced into the cDNA clone. Pathogenesis studies with several positive-strand 
viruses, including the picornaviruses and the alphaviruses have been advanced 
significantly by the use of full-length cDNA clones. 
5 As another useful application, live attenuated viral vaccines may be 

produced using full-length cDNA clones. Live attenuated viral vaccines are among 
the most successful means of controlling viral disease. However for some virus 
pathogens, immunization with a live virus strain may be either impractical or 
unsafe. Sindbis virus is subclinical in humans, but is closely related to other 

10 viruses which do induce clinical diseases in humans, such as Ockelbo, an epidemic 
polyarthritis syndrome common in areas of Scandinavia and Northern Europe. 
Accordingly, Sindbis virus vaccines are desirable for producing an immunogenic 
response to such clinical diseases. Sindbis virus vaccines are also desirable as 
viral carriers in virus constructs which express genes encoding immunizing 

15 antigens for other viruses. See U.S. Patent No. 5,217,879 to Huang et al. Huang 
et al. describes Sindbis infectious viral vectors. However, the reference does not 
describe the cDNA sequence of S.A.AR86 virus, or clones or viral vectors 
produced therefrom. 

Accordingly, there remains a need in the art for full-length cDNA 

20 clones of positive-strand RNA viruses, such as the S.A.AR86 strain of Sindbis. 
In addition, there is a need in the art for full-length cDNA clones of S.A.AR86 
encoding infectious RNA transcripts. Further, there remains a need in the art for 
cDNA clones of S.A.AR86 which encode RNA transcripts which may be used to 
produce infectious attenuated viral particles, and methods of producing such viral 

25 particles. 

Summary of the Invention 

As a first aspect, the present invention provides a recombinant 
DNA comprising a cDNA coding for an infectious South African Arbovirus No. 
86 (S.A.AR86) virus RNA transcript and a heterologous promoter positioned 
30 upstream from the cDNA and operatively associated therewith. The cDNA is 
selected from the group consisting of (0 cDNA having the sequence given herein 
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as SEQ ID NO.: 1, (it) cDNA having the same RNA coding sequence as the 
cDNA given herein as SEQ ID NO.: 1, and (Hi) cDNA according to (i) or (/i) 
above and further containing at least one attenuating mutation. Preferably at least 
one attenuating mutation is included in the cDNA, and more preferably at least 
5 two attenuating mutations are included in the cDNA. Attenuating mutations may, 
for example, be provided in any of the nsPl, E2, and nsP2 coding regions of the 
cDNA. Preferably at least one silent mutation is included in the cDNA in addition 
to the attenuating mutation(s). 

As a second aspect, the present invention provides an infectious 
10 RNA transcript encoded by the cDNA. 

As a third aspect, the present invention provides infectious 
attenuated viral particles containing the RNA transcript encoded by the cDNA. 

The foregoing and other aspects of the present invention are 
explained in detail in the detailed description set forth below. 



15 Brief Description of the Drawings 

Figure 1 shows the relationship of the 3' half of the nsP3 gene 
among various Sindbis-like isolates. 

Figure 2 shows the replacement of of the AR339-derived cDNAs 
into the plasmid pTotollOl background. 

20 Detailed Description of the Invention 

The South African Arbovirus No. 86 (S.A.AR86) viral clones 
employed in practicing the present invention are genomic clones which code for 
an RNA transcript, which RNA trasncript is capable of producing live 
encapsidated S.A.AR86 virus when used to transfect a S.A.AR86 virus-permissive 
25 cell. 

S.A. AR86 virus-permissive cells are cells which, upon transfection 
with the viral RNA transcript, are capable of producing viral particles. The 
S.A.AR86 virus has a broad host cell range. Examples of suitable host cells 
include, but are not limited to Vero cells, baby hamster kidney (BHK-21) cells, 
30 and chicken embryo fibroblast cells. Uptake of the RNA into the cells can be 
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achieved by any suitable means, such as for example, by treating the cells with 
DEAE-dextran, treating the cells with "UPOFECITN™", and by electrophoresis, 
with electrophoresis being the currently preferred means of achieving RNA uptake 
into the host cells. 

5 The phrases "attenuating mutation" and "attenuating amino acid," 

as used herein, mean a nucleotide mutation or an amino acid coded for in view of 
such a mutation which result in a decreased probability of causing disease in its 
host (i.e., a loss of virulence), in accordance with standard terminology in the art. 
See, e.g., B. Davis, et al., Microbiology 132 (3d ed. 1980), whether the mutation 

10 be a substitution mutation or an in-frame deletion mutation. The phrase 
"attenuating mutation" excludes mutations which would be lethal to the virus. 

The phrase "silent mutation" as used herein refers to mutations in 
the cDNA coding sequence which do not produce mutations in the corresponding 
protein sequence translated therefrom. 

15 The cDNA clone has a sequence as given herein as SEQ ID NO.: 

1. Alternatively, the cDNA clone may have a sequence which differs from the 
cDNA of SEQ ID NO.: 1, but which has the same RNA coding sequence as the 
cDNA given herein as SEQ ID NO.: 1. Thus, the cDNA clone may include one 
or more silent mutations. For example, the clone sequence may differ from the 

20 wild-type S.A.AR86 sequence given herein as SEQ ID NO.: 1, by the inclusion 
of silent mutations at any or all of nucleotides 215, 3863, 5984, and 9113. The 
silent mutations at the foregoing nucleotides may be substitution or in-frame 
deletion mutations, such as for example, the substitution of guanine for adenine at 
nucleotide 215 of the cDNA sequence given herein as SEQ ID NO.: 1; or the 

25 substitution of guanine for cytosine at nucleotide 3863 of the cDNA sequence 
given herein as SEQ ID NO.: 1; or the substitution of guanine for adenine at 
nucleotide 5984 of the cDNA sequence given herein as SEQ ID NO.: 1; or the 
substitution of cytosine for thymine at nucleotide 9113 of the cDNA sequence 
given herein as SEQ ID NO. : 1 . In yet another emobidment, the cDNA clone has 

30 a sequence according to either of the foregoing described sequences, but which 
also includes attenuating mutations. The attenuating mutations being described 
more fully hereinafter. 
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Promoter sequences and S.A.AR86 virus cDNA clones are 
operatively associated in the present invention such that the promoter causes the 
cDNA clone to be transcribed in the presence of an RNA polymerase which binds 
to the promoter. The promoter is positioned on the 5' end (with respect to the 
5 virion RNA sequence), or "upstream" from, the cDNA clone. An excessive 
number of nucleotides between the promoter sequence and the cDNA clone will 
result in the inoperability of the construct. Hence, the number of nucleotides 
between the promoter seuqence and the cDNA clone is preferably not more than 
eight, more preferably not more than five, still more preferably not more than 

10 three, and most preferably not more than one. Examples of promoters which are 
useful in the cDNA sequences of the present invention include, but are not limited 
to T3 promoters, T7 promoters, and SP6 promoters. The DNA sequence of the 
present invention may reside in any suitable transcription vector. The DNA 
sequence preferably has a complementary DNA sequence bonded thereto so that 

15 the double-stranded sequence will serve as an active template for RNA 
polymerase. The transcription vector preferably comprises a plasmid. When the 
DNA sequence comprises a plasmid, it is preferred that a unique restriction site 
be provided 3' (with respect to the virion RNA sequence) to (i.e., "downstream" 
from) the cDNA clone. This provides a means for linearizing the DNA sequence 

20 to allow the transcription of genome-length RNA in vitro. 

The cDNA clone can be generated by any of a variety of suitable 
methods known to those skilled in the art. A preferred method is the method set 
forth in U.S. Patent No. 5,185,440 to Davis et aL, and Gubler et al., Gene 25:263 
(1983), the disclosures of which are incorporated herein by reference in their 

25 entirety. Attenuating mutations of S.A.AR86 are identified by sequencing 
attenuated strains of the S.A.AR86 virus and comparing the sequence of the 
attenuated strain with the sequence of the corresponding wild-type virus. Serial 
passage techniques for the generation of attenuated strains may be carried out in 
accordance with known procedures. Preferably, the atenuated strains are generated 

30 by selecting strains at each passage during serial passage in cell culture which 
either grow rapidly or penetrate the cell more rapidly. This selection process, 
which reduces the number of serial passages required to obtain attenuated strains, 
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is known. See, e.g., Olmstead et al., Science 225:424 (1984); and Johnston et al., 
Virology 162:437 (1988). the disclosures of which are incorporated herein by 
reference in their entirety. cDNA clones may be modified to incorporate 
attenuating mutations by site-directed mutagenesis in accordance with known 
5 procedures. An exemplary technique is that of Kunkel Proc. Natl. Acad. Sci. 
(USA) 82:488 (1985). These same techniques may be used to join the 
heterologous promoter to the cDNA clone. 

RNA is preferably synthesized from the DNA sequence in vitro 
using purified RNA polymerase in the presence of ribonucleotide triphosphates in 

10 accordance with conventional techniques. 

Pharmaceutical compositions, such as vaccines, containing the 
S.A.AR86 clone of the present invention comprise an immunogenic amount of a 
live attenutated virus as disclosed herein in combination with a pharmaceutical^ 
acceptable carrier. An "effective immunogenic amount" is an amount of the 

15 attenuated virus sufficient to evoke an immune response in the subject to which the 
vaccine is administered. An amount of about 10 1 to about 10 s plaque forming 
units per dose is believed to be suitable, depending upon the age and species of the 
subject being treated. Examples of suitable pharmaceutically acceptable carriers 
include, but are not limited to, sterile pyrogen-free water and sterile pyrogen-free 

20 physiological saline solution. Subjects which may be administered immunogenic 
amounts of the live attenuated viruses of the present invention include both human 
and animal (e.g., horse, donkey, mouse, hamster, or monkey) subjects. 
Administration may be by a suitable means, such as intraperitoneoal, intracerebral 
or intramuscular injection. 

25 Complimentary DNA clones of the S.A.AR86 virus are made in 

accordance with the procedures described herein, as supplemented with procedures 
known in the art. We employed as a starting material, the S.A.AR86 virus. 

A first exemplary attenuating substitution mutation in a S.A.AR86 
clone useful in practicing the present invention is a substitution mutation which 

30 codes for an attenuating amino acid, preferably isoleucine, at nsPl amino acid 
residue 538. 
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A second exemplary attenuating subsitution mutation in a S. A. AR86 
clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably threonine, at E2 amino acid 
residue 304. 

5 A third exemplary attenuating subsitution mutation in a S.A.AR86 

clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably lysine, at E2 amino acid residue 
314. 

A fourth exemplary attenuating subsitution mutation in a S. A. AR86 
10 clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably valine, at E2 amino acid residue 
372. 

A fifth exemplary attenuating substitution mutation in a S. A. AR86 
clone useful in practicing the present invention is a substitution mutation which 
15 codes for an attenuating amino acid, preferably alanine, at E2 amino acid residue 
376. 

One embodiment of the present invention contains, in combination, 
said attenuating substitution mutations at E2 amino acid residues 304, 314, 372, 
and 376. 

20 A sixth exemplary attenuating subsitution mutation in a S.A.AR86 

clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably glycine, at nsP2 amino acid residue 
96. 

A seventh exemplary attenuating subsitution mutation in a S. A. AR86 
25 clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably valine, at nsP2 amino acid residue 
372. 

One embodiment of the present invention contains, in combination, 
said attenuating substitution mutations at nsP2 amino acid residues 96 and 372. 
30 An eighth exemplary attenuating substitution mutation in an 

S.A.AR86 clone useful in practicing the present invention is a substitution 
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mutation which codes for an attenuating amino acid, preferably leucine, at nsP2 
amino acid residue 529. 

A nineth exemplary attenuating substitution mutation in an 
S.A.AR86 clone useful in practicing the present invention is a substitution 
5 mutation which codes for an attenuating amino acid, preferably asparagine, at nsP2 
amino acid residue 571. • 

A tenth exemplary attenuating substitution mutation in an S.A.AR86 
clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably arginihe, at nsP2 amino acid 
10 residue 682. 

An eleventh exemplary attenuating substitution mutation in an 
S.A.AR86 clone useful in practicing the present invention is a substitution 
mutation which codes for an attenuating amino acid, preferably arginine, at nsP2 
amino acid residue 804. 

15 A twelveth exemplary attenuating substitution mutation in an 

S.A.AR86 clone useful in practicing the present invention is a substitution 
mutation which codes for an attenuating amino acid, preferably arginine, at nsP3 
amino acid residue 22. 

One embodiment of the present invention contains, in combination, 

20 said attenuating substitution mutations at nsP2 amino acid residues 529, 571, 682, 
and 804, and at nsP3 amino acid residue 22. 

The cDNA clones according to the present invention are useful for 
the preparation of pharmaceutical formulations, such as vaccines, as described 
above. In addition, the cDNA clones of the present invention are useful for 

25 administration to animals for the purpose of producing antibodies to the S.A.AR86 
virus, which antibodies may be collected and used in known diagnostic techniques 
for the detection of S.A.AR86 virus. 

The following examples are provided to illustrate the present 
invention, and should not be construed as limiting thereof. In these examples, nt 

30 means nucleotide. 
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EXAMPLE 1 

Relationship of S. A. AR86 Clone to Other Sindbis Strains 

At the nucleotide level, S.A.AR86 differs from the consensus 
sequence of Sindbis strain AR339 by 685 nucleotides (1 1 1 amino acids). From the 
5 published Sindbis HR sequence (See, Strauss et al., Virol 133:92 (1984)), 
S.A.AR86 differs by 704 nucleotides and 119 amino acids. S.A.AR86 differs 
from the sequence of Ockelbo82 (See, Shirako et al., Virol 182:753 (1991)) by 
430 nucleotides (67 amino acids). Included in these differences are several 
insertions and deletions present in S.A. AR86 relative to the Sindbis sequences The 
10 relationship of the 3' half of the nsP3 gene among various Sindbis-like isolates is 
shown in Figure 1. 

EXAMPLE 2 
Observed Mortality in Mice Infected with S.A.AR86 
When adult mice are inoculated intracerebrally (i.e.) with low doses 
15 of S.A.AR86, 100% motality is observed. The methods employed for evaluating 
the instance mortality in mice are set forth in D. Russell, et al., 7. Virol 
63(4): 1619 (1989), the disclosure of which is incorporated herein by reference in 
its entirety. This high mortality rate is unique among "Sindbis-like" viruses which 
are characteristically avirulent in adult mice. Also unique in the S. A.AR86 strain 
20 is the cys substitution for the opal stop codon normally found between the nsP3 
and nsP4 genes. 

EXAMPLE 3 
Construction of S.A.AR86 Clone 

The S.A.AR86 clone is constructed by substituting partial cDNA 
25 clones of S.A.AR86 genomic RNA into pTR5000, one of a series of Sindbis 
AR339 cDNA clones. Construction of pTR5000 (a full-length cDNA clone of 
Sindbis following the SP6 phage promoter and containing mostly Sindbis AR339 
sequences) is accomplished by sequential replacement of AR339-derived cDNAs 
into the plasmid pTotollOl background, according to the technique described in 
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Rice et al., 7. Virol 61:3809 (1987). The replacement of of the AR339-derived 
cDNAs into the plasmid pTotollOl background is shown in Figure 2. 

Production of the cDNAs used in constructing pTR5000 has been 
described previously in Polo et al., 7. Virol 62:2124 (1988), as has the 
5 construction of pTR2000, see Polo et al. , 7. Virol 62:2124 (1988) and Polo et al. , 
7. Virol 64:4438 (1990). Nucleotide numbering follows that of Strauss et al. 
Virol 133:92 (1984). 

The StuI (nt8571) to SacII (ntll484) fragment of pTotollOl is 
removed and replaced with the analogous fragment from clone pSB4 to form 

10 pTR2000, using the loss of the pTotollOl StuI site at ntl0770 (a site not present 
in AR339) as a screen, as shown in Figure 2. The sequences of AR339 and 
pTotollOl are identical from ntll485 to the 3'-end (ntll703). Therefore, as 
shown in Figure 2, these 3' sequences are of AR339 origin. Construction of 
pTR3000 is accomplished by replacement of the BssHII (nt9804) to StuI (nt8571) 

15 fragment of pSB3 into pTR2000 from which the analogous fragment had been 
removed. The AfUI site found at nt8835 in pTotollOl but absent in AR339 is 
used to screen the recombinants. An AR339 fragment from pSBl, Spel (nt5262) 
to BssHII, is used to replace the Spel-BssHII fragment from pTR2000, using the 
Aflll screen and forming pTR4000. To construct pTR5000, pSB5 is subcloned 

20 into pUC119, and the PstI site at nt3953 is ablated using site-directed mutagenesis, 
as described in Kunkel, Methods Enzymol 154:367 (1987), to change nt3950 
from U to C. The Clal (nt2713) to Spel fragment is removed from the 
mutagenized subclone and for pTR5000, is used to replace the analogous fragment 
of pTR4000 using the ablated PstI site as a screen. 

25 Partial cDNA clones of S.A.AR86 were obtained using classical 

reverse transcriptase (RT) procedures according to Polo et al., 7. Virol 62:2124 
(1988), as well as RT-PCR protocols according to Heidner et al., 7. Virol 
68:2683 (1994). These cDNA clones are used to replace analogous portions of the 
clone pTR5000, culminating in the construction of a full-length cDNA of the 

30 S.A.AR86 genomic sequence downstream of an SP6 promoter and followed by a 
poly (A) tract and a unique Xbal site. During the course of replacing S.A.AR86 
sequences into pTR5000, it was observed that the pTRSOOO nonstructural proteins 
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are incompatible with those of S.A.AR86, so that the chimeric clones yielded 
transcripts which are not infectious for baby hamster kidney (BHK-21) cells. In 
addition, the restriction sites in the nonstructural region of S.A.AR86 are very 
different from the Sindbis AR339-based clones. The first complete S.A.AR86 
5 clone, pSIO, also failed to yield infectious transcripts. 

The construction is repeated, beginning with pTR5000, using 
sequences derived from the same partial cDNA clones of S.A.AR86 as are used 
for the construction of pSIO except for nucleotides 3171 to 6410 (numbering from 
nucleotide 1 of the S.A.AR86 sequence), which are derived by RT-PCR of the 
10 genomic RNA. The resulting construct, pS22, gives infectious transcripts, but the 
virus derived therefrom is temperature-sensitive. Replacement of nucleotides 3171 
to 6410 with an alternative RT-PCR derived cDNA corrected the temperature- 
sensitive defect, yielding clone pS24. 

EXAMPLE 4 

15 Observed Mortality in Mice Infected with S.A.AR86 

Upon i.e. inoculation virus derived from pS24 is avirulent, whereas 
S.A.AR86 caused 100% mortality. Clearly, pS24 contained one or more 
mutations which were strongly attenuating. The complete sequence of pS24 was 
determined directly from pS24 and related clones. Comparison with the 

20 S.A.AR86 genomic RNA sequence reveals 5 mutations or clusters of mutations 
which are potentially associated with the avirulent phenotype of virus from pS24. 
These included the mutations or clusters of mutations indicated in clones pS56, 
pS51, and pS57, a mutation at ntl278 A-C (nsPl 407 K-Q), and a mutation at 
nt5972 T-G (nsP3 228 N-S). While said nsPl 407 substitution alone is a lethal 

25 mutation, the viruses are viable when the mutation co-exists with a serine at the 
nsP3 amino acid residue 228. Clone pS24 is corrected by by a combination of site- 
directed mutagenesis and replacement of specific pS24 sequences with cDNA 
fragments which do not contain the subject mutations. The resulting cDNA is 
pS55 which contains 5' and 3' untranslated sequences identical to S.A.AR86 

30 genomic RNA, no coding differences with S.A.AR86 genomic RNA, and the 4 
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non-coding changes at nt 215 A-G, nt 3863 C-G, nt 5984 A-G, and nt 9113 T-C. 
Virus derived from pS55 is indistinguishable from native S.A.AR86 in tests of 
virulence, and growth in adult mice and by histopathological analysis of tissues 1 
from infected animals. These results, which are reported in Table 1 below, 
5 indicate that virus derived from clone pS55 accurately reflects native S.A.AR86 
both in terms of coding sequence and in vivo phenotype. 

Table 1 

VIRULENCE IN MICE 



10 



AGE OF MICE 


S.A.AR86 


S55 


S56 2 


S51 3 


4 to 6 Weeks 










Mortality 


100% 


100% 


80% 


20% 


AST' 


6.36 ± 1.39 


6.37 ± 1.4 


9.37 ±1.77 


8.5 ± 0.7 



'Average Survival Time in days. 

2 Virus isogenic with S55 except at nucleotide 6 (C-A). 

15 3 Virus isogenic with S55 except at nucleotide 1648 (OT). This is nsPl amino acid 538 (T 
The isoleucine is the amino acid found in all other Sindbis isolates sequenced to date. 



The 3 mutations or clusters of mutations in clones pS56, pS51, and 
pS57 are placed independently into the pS55 background. Virus derived from each 
of these clones is highly attenuated in adult mice inoculated i.e. In clone pS53, 

20 the mutations at nucleotides 6 and 1672 are combined, and the resulting virus is 
avirulent in the adult mouse model. The mutations in pS61 are present in pS48, 
an intermediate clone constructed during the repair of pS24. The virus from pS48 
produced small plaques on BHK-21 cells. When these mutations are placed in the 
pS55 background, they also gave a highly attenuated phenotype. (S61 , virus derived 

25 from pS61, gave 33.3% mortality (6±4.2)). 

The foregoing is illustrative of the present invention and is not to 
be construed as limiting thereof. The invention is defined by the following claims, 
with equivalents of the claims to be included therein. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(1) APPLICANT: Johnston. Robert E. 

Simpson. Oennis 
Davis. Nancy L. 

(ii) TITLE OF INVENTION: cONA Clone for South African Arbovirus No. 86 

(iii) NUMBER OF SEQUENCES: 1 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Kenneth D. Sibley 

(B) STREET: Post Office Drawer 34009 

(C) CITY: Charlotte 
(0) STATE: NC 

(E) COUNTRY: USA 

(F) ZIP: 28234 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 

(vi ) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Sibley. Kenneth D. 

(B) REGISTRATION NUMBER: 31.665 

(C) REFERENCE/DOCKET NUMBER: 5470-118 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (919) 881-3140 

(B) TELEFAX: (919) 881-3175 

(C) TELEX: 575102 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATTGGCGGCG TAGTACACAC TATTGAATCA AACAGCCGAC CAATTGCACT ACCATCACAA 60 

TGGAGAAGCC AGTAGTTAAC GTAGACGTAG ACCCTCAGAG TCCGTTTGTC GTGCAACTGC 120 

AAAAGAGCTT CCCGCAATTT GAGGTAGTAG CACAGCAGGT CACTCCAAAT GACCATGCTA 180 



WO 96/37220 PCT/US96/07457 

-14- 



ATrrr ap a rr 


ATTTTPPP A T 

Al 1 1 ILuLAI 


rrrrrr a pt a 
U 1 CxbLLAb ! A 


AAPTA ATPPA 

AAL 1 AA 1 LbA 


rnnr a rrxr 
bulbbAbbl 1 


ppTArr Ar ao 


240. 


pp a pp atttt 


rr apatappt 
bbAlAIAubl 


acpppatppp 


PTPP.TAPAAT 
LILbl AbAAl 


PTTTTPPP AP 

b 1 1 1 1 LUbAb 


PAPPAPTAPP 


300 


attppp flip 
AUbLbl 1 lb 


ppppatpppt 
LLLLA 1 btb 1 


aptppapaap 
Ab 1 LLAbAAb 


ALLLWaALLb 


PATPATPA AA 

LAIbAlbAAA 


T ATPPP A Pr A 

lAlbLCAbCA 


360 


A A PTP/V/T A 

MCTbbCbbA 


AAA APPATPT 

AAAAbLAIbl 


A AP ATTAP A A 

AAbAI lACAA 


APA APA APTT 

AtAAGAAU 1 


PPATPAPA AP 

bCATGAbAAb 


ATr a Arr Arr 
ATCAAbbACC 


a on 

420 


ta Af»/"» appot 

TCCGGACCGT 


ACTTGATACA 


a a a a a tjtto 

CCGGATGCTG 


A A A A A A A * TA 

AAACGCCATC 


A ATATA ATTA 

ACTCTGCTTC 


A A A A A AA A TA 

CACAACGATG 


480 


TTACCTGCAA 


CACGCGTGCC 


GAGTACTCCG 


TP»Tpr»pp* 

TCATGCAGGA 


A AT AT AAA TA 

CGTGTACATC 


A A AAATAAAA 

AACGCTCCCG 


540 


A a A A^ A 1 '1 1 A 

GAACTATTTA 


a a a a a A r\ i 

CCACCAGGCT 


A A A A 

ATGAAAGGCG 


TAAAA A A A AT 

TGCGGACCCT 


AT A ATAA A t 1 

GTACTGGATT 


AAA^'g AA A A 

GGCTTCGACA 


600 


A A 1 AAA A /» 1' 1 

CCACCCAGTT 


CATGTTCTCG 


AWT A TAAA A A 

GCTATGGCAG 


ATI A AT" A AAA 

GTTCGTACCC 


TA A A T AAA A A 

TGCATACAAC 


A A A A A ATAAA 

ACCAACTGGG 


660 


AAA A 0/> A A A A 

CCGACGAAAA 


a nwnTM A A 

AGTCCTTGAA 


PPPPPTA A/* A 

GCGCGTAACA 


TAAA A ATT A TP 

TCGGACTCTG 


O IMA AAA A A A 

CAGCACAAAG 


AT A A ATA AAA 

CTGAGTGAAG 


720 


/-» /-» a AA i/Hi A A 

GCAGGACAGG 


a a APTTPT/T 

AAAGTTGTCG 


ATA A ta A OP A 

ATAATGAGGA 


A A « A A A A PTT 

AGAAGGAGTT 


a «APrrropp 

GAAGCCCGGG 


T/*» A A AAA 1 1 f 

TCACGGGTTT 


780 


a | 1 1 ata a^t 

ATTTCTCCGT 


TA A jt^pa a A 

TGGATCGACA 


a i i i a a a a a a 

CTTTACCCAG 


A APAAAAAAA 

AACACAGAGC 


/* A f> PTTAA A A 

CAGCTTGCAG 


A A ATA A A A TA 

AGCTGGCATC 


840 


TTCCATCGGT 


GTTCCACTTG 


AAA An AAA A A 

AAAGGAAAGC 


A ATA AT A A A A 

AGTCGTACAC 


1 1 AAAAATAT 

TTGCCGCTGT 


A A T AAA ATAA 

GATACAGTGG 


900 


TGAGCTGCGA 


AGGCTACGTA 


GTGAAGAAAA 


TCACCATCAG 


TCCCGGGATC 


ACGGGAGAAA 


960 


CCGTGGGATA 


CGCGGTTACA 


AACAATAGCG 


AGGGCTTCTT 


GCTATGCAAA 


GTTACCGATA 


1020 


CAGTAAAAGG 


AGAACGGGTA 


TCGHCCCCG 


TGTGCACGTA 


TATCCCGGCC 


ACCATATGCG 


1080 


ATCAGATGAC 


CGGCATAATG 


GCCACGGATA 


TCTCACCTGA 


CGATGCACAA 


AAACTTCTGG 


1140 


TTGGGCTCAA 


CCAGCGAATC 


GTCATTAACG 


GTAAGACTAA 


CAGGAACACC 


AATACCATGC 


1200 


AAAATTACCT 


TCTGCCAATC 


ATTGCACMG 


GGTTCAGCAA 


ATGGGCCAAG 


GAGCGCAAAG 


1260 


AAGATCTTGA 


CAATGAAAAA 


ATGCTGGGCA 


CCAGAGAGCG 


CAAGCTTACA 


TATGGCTGCT 


1320 


TGTGGGCGTT 


TCGCACTAAG 


AAAGTGCACT 


CGTTCTATCG 


CCCACCTGGA 


ACGCAGACCA 


1380 


TCGTAAAAGT 


CCCAGCCTCT 


TTTAGCGCTT 


TCCCCATGTC 


ATCCGTATGG 


ACTACCTCJT 


1440 


TGCCCATGTC 


GCTGAGGCAG 


AAGATGAAAT 


TGGCATTACA 


A A A A A A A A A A 

ACCAAAGAAG 


A A A A A A A A A A 

GAGGAAAAAC 


1500 


TGCTGCAAGT 


CCCGGAGGAA 


TTAGTTATGG 


AGGCCAAGGC 


TGCTTTCGAG 


f% A TAATA A A A 

GATGGTCAGG 


1560 


a a a t a a a a 

AGGAATCCAG 


a aa aa a a a a a 

AGCGGAGAAG 


AT«rtA AAA A A 

CTCCGAGAAG 


A A ATAA A A A A 

CACTCCCACC 


A TT A ATAA A ■ 

ATTAGTGGCA 


A A A A A A PPT A 

GACAAAGGT A 


1620 


T/^O A AAA A /"> A 

TCGAGGCAGC 


ta a aa a a m r 

TGCGGAAGTT 


AT AT" A A A AAA 

GTCTGCGAAG 


T A A A A A A A AT 

TGGAGGGGCT 


AA A AAA AA « A 

CCAGGCGGAC 


A APAAAAAA A 

ACCGGAGCAG 


1680 


CACTCGTCGA 


AACCCCGCGC 


PPTPATPT* A 

GGTCATGTAA 


A A ATA ITAPP 

GGATAATACC 


X/* A A P/* A A AT* 

TCAAGCAAAT 


GACCGTATGA 


1740 


TAAA AAA A t A 

TCGGACAGTA 


TATCGTTGTC 


TA/^APP A TAT 

TCGCCGATCT 


AT AT A ATA A A 

CTGTGCTGAA 


AAA A A AT AAA 

GAACGCTAAA 


CTCGCACCAG 


1800 


CACACCCGCT 


AGCAGACCAG 


< | ■ f A A A M VA A 

GTTAAGATCA 


T A A AAA A ATA 

TAACGCACTC 


AAA A A A A TA A 

CGGAAGATCA 


AA A A A AT ATA 

GGAAGGTATG 


1860 


CAGTCGAACC 


ATACGACGCT 


AAA A^F * A-T* A A 

AAAGTACTGA 


TAAA A AA A AA 

TGCCAGCAGG 


A A A T A A A A '1 A 

MGTGCCGTA 


A a A TAAA A A A 

CCAT6GCCAG 


1920 


AATTCTTAGC 


ACTGAGTGAG 


AGCGCCACGC 


TTGTGTACAA 


CGAAAGAGAG 


TTTGTGAACC 


1980 


GCMGCTGTA 


CCATATTGCC 


ATGCACGGTC 


CCGCTAAGAA 


TACAGAAGAG 


GAGCAGTACA 


2040 


AGGTTACAAA 


GGCAGAGCTC 


GCAGAAACAG 


AGTACGTGTT 


TGACGTGGAC 


AAGAAGCGAT . 


2100 


GCGTTAAGAA 


GGAAGAAGCC 


TCAGGACTTG 


TCCTTTCGGG 


AGAACTGACC 


AACCCGCCCT 


2160 


ATCACGAACT 


AGCTCTTGAG 


GGACTGAAGA 


CTCGACCCGC 


GGTCCCGTAC 


AAGGTTGAAA 


2220 
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CAATAGGAGT 


GATAGGCACA 


CCAGGATCGG 


GCAAGTCAGC 


TATCATCAAG 


TCAACTGTCA 


2280 


ppppapptpa 
bbbLALb 1 bA 


t/'i ipttapp 
Ibl Ibl IACC 


AbCbbAAAGA 


AA^AAtA a-v- a 

AAGAAAACTG 


f*f \ \ M A A 'T' T" 

CCGCGAAATT 


GAGGCCGACG 


2340 


1 b t 1 ALbbu I 


papppppatp 
bAbbbuLAlb 


PAP ATP APPT 

CAGATCACGT 


CGAAGACAGT 


A A a T A. A A y T 

GGATTCGGTT 


ATGCTCAACG 


2400 


gatp^papaa 

UA 1 UblALAA 


ap.ppp.tap a a 

AbLLb J MbAA 


PTPPTPTATP 
ulbbiblAlb 


TTP APP A APP 

1 IbACbAAbC 


PTTPP/WTO/* 

GTTCCGGTGC 


AA/»A n A^AA/> 

CACGCAGGAG 


2460 


paptapttpp 


PTTPATTPPA 
b 1 1 uA 1 1 ULA 


ATPCTPAPAP 
A 1 Lb 1 LAbAL 


PPPPTA APA A 

LLbbTAAbAA 


PPT APT APT A 

GbTAGTACTA 


T/*^AO a ft a a a 

TGCGGAGACC 


2520 


fTAAGCAATG 


pggattpttp 


aapatpatpp 

HMLH 1 UA 1 Uu 


AAPTAAAPPT 
AAL 1 AAAbb 1 


APATTTPAAP 
ALAI 1 ILAAL 


PAPPPTPA A A 

LALLLTbAAA 


iron 

2580 


AAGAPATATG 


TAPPAAP.APA 


i rn APAAPT 
1 HI ALAAb 1 


TTATPTPPPP 
i 1 AIU LLLb 


APPTTPPAPA 

Abb I IbbAbA 


PAPPP APTP A 

CAGCCAGTCA 


2640 


pggptattpt 


ATPPAPAnp 


PATTAPPATP 
LAI lAbbAlb 


PAAAAATPAA 

bAAAAAlbAA 


A APPAPA A AP 

AAlLALAAAl 


/^rnrf^r* a a p a 

CCGTGCAAGA 


2700 


APAAPATPPA 
AbAAbA 1 buA 


AATPPAPATT 
AAlLbALAl 1 


APAPPPPPPA 
ALAbbbbLLA 


PPA APPPPA A 

LbAAbCCGAA 


PPP A rvr*/^ a /■» 

GCCAGGGGAC 


ATCATCCTGA 


2760 


p atp t t f ppp 

CATbTTTCCG 


PPPPTPPPTT 

CGGGTGGGTT 


A A O/* A A rTr>0 

AAGCAACTGC 


A A tT A Al a 

AAATCGACTA 


TCCCGGACAT 


gaggtmtga 


2820 


CAGCCGCGGC 


CTCACAAGGG 


CTAACCAGAA 


AAGGAGTATA 


TGCCGTCCGG 


CAAAAAGTCA 


2880 


ATGAAAACCC 


GCTGTACGCG 


ATCACATCAG 


AGCATGTGAA 


CGTGTTGCTC 


ACCCGCACTG 


2940 


AGGACAGGCT 


AGTATGGAAA 


ACTTTACAGG 


GCGACCCATG 


GATTAAGCAG 


CTCACTAACG 


3000 


TACCTAAAGG 


AAATTTTCAG 


GCCACCATCG 


AGGACTGGGA 


AGCTGAACAC 


AAGGGAATAA 


3060 


TTGCTGCGAT 


AAA A** A T ^ *S 

AAACAGTCCC 


GCTCCCCGTA 


CCAATCCGTT 


CAGCTGCAAG 


ACTAACGTTT 


3120 


GCTGGGCGAA 


A ^ ^ A a a 

AGCACTGGAA 


CCGATACTGG 


CCACGGCCGG 


TATCGTACTT 


ACCGGTTGCC 


3180 


AGTGGAGCGA 


GCTGTTCCCA 


CAGTTTGCGG 


ATGACAAACC 


ACACTCGGCC 


ATCTACGCCT 


3240 


TAGACGTAAT 


TTGCATTAAG 


till ICGGCA 


TGGACTTGAC 


MGCGGGCTG 


TTTTCCAAAC 


3300 


AGAGCATCCC 


GTTAACGTAC 


CATCCTGCCG 


ACTCAGCGAG 


GCCAGTAGCT 


CATTGGGACA 


3360 


ACAGCCCAGG 


AACACGCAAG 


TATGGGTACG 


ATCACGCCGT 


TGCCGCCGAA 


CTCTCCCGTA 


3420 


GATTTCCGGT 


GTTCCAGCTA 


GCTGGGAAAG 


GCACACAGCT 


TGATTTGCAG 


ACGGGCAGAA 


3480 


CTAGAGTTAT 


CTCTGCACAG 


CATAACTTGG 


TCCCAGTGAA 


CCGCAATCTC 


CCTCACGCCT 


3540 


TAGTCCCCGA 


GCACAAGGAG 


AAACAACCCG 


GCCCGGTCGA 


AAAATTCTTG 


AGCCAGTTCA 


3600 


AACACCACTC 


CGTACTTGTG 


A ^T^'ff X% A S\ A A A 

ATCTCAGAGA 


AAAAAATTGA 


AGCTCCCCAC 


AAGAGAATCG 


3660 


AATGGATCGC 


CCCGATTGGC 


ATAGCCGGCG 


CAGATAAGAA 


CTACAACCTG 


GCTTTCGGGT 


3720 


TTCCGCCGCA 


a a a a MA/vri « 

GGCACGGTAC 


GACCTGGTGT 


*TfS A T" A A ^P A — T- 

TCATCAATAT 


TGGAACTAAA 


TACAGAAACC 


3780 


ATCAUTTCA 


ACAGTGCGAA 


GACCACGCGG 


a a a a a y r a a a 

CGACCTTGAA 


A A >|l| A, a, 

AACCCTTTCG 


A A | f A A AAA> A 

CGTTCGGCCC 


3840 


TGAACTGCCT 


taa t*r*r*f*r*F* a 

TAACCCCGGA 


GGCACCCTCG 


f/^A«T-/N a a ATA 

TGGTGAAGTC 


A^- « A A A-» f S A 

CTACGGTTAC 


a a a a a a a a a a 

GCCGACCGCA 


3900 


ATAGTGAGGA 


CGTAGTCACC 


a AT"T» A A A a 

GCTCTTGCCA 


Ai a a a a TFT 't I 

GAAAATTTGT 


CAGAGTGTCT 


* AAA A A--fc A 

GCAGCGAGGC 


3960 


CAGAGTGC6T 


CTCAAGCAAT 


ACAGAAATGT 


ACCTGATTTT 


CCGACAACTA 


GACAACAGCC 


4020 


GCACACGACA 


ATTCACCCCG 


CATCATTTGA 


ATTGTGTGAT 


TTCGTCCGTG 


TACGAGGGTA 


4080 


CAAGAGACGG 


AGTTGGAGCC 


GCACCGTCGT 


ACCGTACTAA 


MGGGAGAAC 


ATTGCTGATT 


4140 


GTCAAGAGGA 


AGCAGTTGTC 


AATGGAGCCA 


ATCCACTGGG 


CAGACCAGGA 


GAAGGAGTCT 


4200 


GCCGTGCCAT 


CTATAAACGT 


TGGCCGAACA 


GTTTCACCGA 


TTCAGCCACA 


GAGACAGGTA 


4260 
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CCGCAAAACT GACTGTGTGC CAAGGAAAGA AAGTGATCCA CGCGGTTGGC CCTGATTTCC 4320 

GGAAACACCC AGAGGCAGAA GCCCTGAAAT TGCTGCAAAA CGCCTACCAT GCAGTGGCAG 4380 

ACTTAGTAAA TGAACATAAT ATCAAGTCTG TCGCCATCCC ACTGCTATCT ACAGGCATTT 4440 

ACGCAGCCGG AAAAGACCGC CTTGAGGTAT CACTTAACTG CTTGACAACC GCGCTAGACA 4500 

GAACTGATGC G6ACGTAACC ATCTACTGCC TGGATAAGAA GTGGAAGGAA AGAATCGACG 4560 

CGGTGCTCCA ACTTAAGGAG TCTGTAACTG AGCTGAAGGA TGAGGATATG GAGATCGACG 4620 

ACGAGTTAGT ATGGATCCAT CCGGACAGTT GCCTGAAGGG AAGAAAGGGA TTCAGTACTA 4680 

CAAAAGGAAA GTTGTATTCG TACTTTGAAG GCACCAAATT CCATCAAGCA GCAAAAGATA 4740 

TGGCGGAGAT AAAGGTCCTG TTCCCAAATG ACCAGGAAAG CAACGAACAA CTGTGTGCCT 4800 

ACATATTGGG GGAGACCATG GAAGCAATCC GCGAAAAATG CCCGGTCGAC CACAACCCGT 4860 

CGTCTAGCCC GCCAAAAACG CTGCCGTGCC TCTGTATGTA TGCCATGACG CCAGAAAGGG 4920 

TCCACAGACT CAGAAGCAAT AACGTCAAAG AAGTTACAGT ATGCTCCTCC ACCCCCCTTC 4980 

CAAAGTACAA AATCAAGAAT GTTCAGAAGG TTCAGTGCAC AAAAGTAGTC CTGTTTAACC 5040 

CGCATACCCC CGCATTCGTT CCCGCCCGTA AGTACATAGA AGCACCAGAA CAGCCTGCAG 5100 

CTCCGCCTGC ACAGGCCGAG GAGGCCCCCG GAGTTGTAGC GACACCAACA CCACCTGCAG 5160 

CTGATAACAC CTCGCTTGAT GTCACGGACA TCTCACTGGA CATGGAAGAC AGTAGCGMG 5220 

GCTCACTCTT TTCGAGCTTT AGCGGATCGG ACAACTACCG AAGGCAGGTG GTGGTGGCTG 5280 

ACGTCCATGC CGTCCAAGAG CCTGCCCCTG TTCCACCGCC AAGGCTAAAG AAGATGGCCC 5340 

GCCTGGCAGC GGCAAGAATG CAGGAAGAGC CAAaCCACC GGCAAGCACC AGCTCTGCGG 5400 

ACGAGTCCCT TCACCTTTCT TTTGATGGGG TATCTATATC CTTCGGATCC CTTTTCGACG 5460 

GAGAGATGGC CCGCTTGGCA GCGGCACMC CCCCGGCAAG TACATGCCCT ACGGATGTGC 5520 

CTATGTCTTT CGGATCGTTT TCCGACGGAG AGATTGAGGA GTTGAGCCGC AGAGTAACCG 5580 

AGTCGGAGCC CGTCCTGTTT GGGTCATTTG AACCGGGCGA AGTGAACTCA ATTATATCGT 5640 

CCCGATCAGC CGTATCTTTT CCACCACGCA AGCAGAGACG TAGACGCAGG AGCAGGAGGA 5700 

CCGAATACTG TCTAACCGGG GTAGGTGGGT ACATATTTTC GACGGACACA GGCCCTGGGC 5760 

ACTTGCAAAA GAAGTCCGTT CTGCAGAACC AGCTTACAGA ACCGACCTTG GAGCGCAATG 5820 

TTCTGGAAAG AATCTACGCC CCGGTGCTCG ACACGTCGAA AGAGGAACAG CTCAAACTCA 5880 
GGTACCAGAT GATGCCCACC GAAGCCAACA AAAGCAGGTA CCAGTCTCGA AAAGTAGAAA 5940 
ACCAGAAAGC CATAACCACT GAGCGACTGC TTTCAGGGCT ACGACTGTAT AACTCTGCCA 6000 
CAGATCAGCC AGAATGCTAT AAGATCACCT ACCCGAAACC ATCGTATTCC AGCAGTGTAC 6060 
CAGCGAACTA CTa&ACCCA AAGTTTGCTG TAGCTGTTTG TAACAACTAT CTGCATGAGA 6120 
ATTACCCGAC GGTAGCATCT TATCAGATCA CCGACGAGTA CGATGCTTAC TTGGATATGG 6180 
TAGACGGGAC AGTCGCTT6C CTAGATACTG CAACTTTTTG CCCCGCCAAG CTTAGAAGTT 6240 
ACCCGAAAAG ACACGAGTAT AGAGCCCCAA ACATCCGCAG TGCGGTTCCA TCAGCGATGC 6300 
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GPGAPPGPTT 


APPTATGPP.P, 
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P.ATPPAPPPP 
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P.APTTAPTPP 

uAbl lAblbL 


O/Z0 


GCAGGCTTAC 


AGCCGTTTTG 


CTACCCAACA 


TTCACACGCT 


PTTTPAPATR 


TPPPPPPAPP 
1 LUULuUAbU 


D/OU 


ACTTTGATGP 


AATPATAGPA 


GAAPAPTTPA 
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P.APAPPPATA 
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Oo40 
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LbALAAAAbL 
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PTATPPPPTT 
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A APPPPPPTP 
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ATP A TPTTPP 

ATGATCTTGG 


6900 


AAGACCTGGG 


TPTPP A pp A A 

TGTGGACCAA 


P A nf A S*^*S*M 

CCACTACTCG 


ACTTGATCGA 


GTGCGCCTTT 


GGAGAAATAT 


6960 


CATCCACCCA 


TCTGCCCACG 


GGTACCCGTT 


TCAAATTCGG 


GGCGATGATG 


AAATCCGGAA 


7020 


TGTTCCTCAC 


GCTCTTTGTC 


AACACAGTTC 


TGAATGTCGT 


TATCGCCAGC 


AGAGTATTGG 


7080 


a a pppp pt 

AGGAGCGGCT 


t* a a a a r^Trr 

TAAAACGTCC 


AAA T /*» T O A A 

AAATGTGCAG 


CATTTATCGG 


CGACGACAAC 


ATTATACACG 


7140 


P A pt A PT A TP 

bAbiAbTAlL 
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A TPPPTP A O A 
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GGTGTGCCAC 


PTPP/^T^ A A 
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A ^AA\ A AATT A 

ATGGAGGTTA 


7200 


a p a tp a nr a 

AGATCATTGA 


CGCAGTCATC 


GGCGAGAGAC 


P A PPTT A PTT 

CACCTTACTT 


CTGCGGTGGA 


TTCATCTTGC 


7260 


A A P A "rrr^r»T 

AAGATTCGGT 


TACCTCCACA 


GCGTGTCGCG 


TGGCGGACCC 


^ f * /*\ A A A A 

CTTGAAAAGG 


CTGTTTAAGT 


7320 
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& A A A> O A A A 

AAGACGAAGA 
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GCTCTGCTAG 


7380 


ATGAAACAAA 


GGCGTGGTTT 


AGAGTAGGTA 


TAACAGACAC 


CTTAGCAGTG 


GCCGTGGCAA 


7440 


T nnf% ¥ A A 

CTCGGTATGA 


GGTAGACAAC 


ATCACACCTG 


TCCTGCTGGC 


ATTGAGAACT 


TTTGCCCAGA * 


7500 
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ATTTCAAGCC 
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ATCAGAGGGG 


A A A T A A A OP A 

AAATAAAGCA 


TPTPT A PPPT 

TCTCTACGGT 


GGTCCTAAAT 


7560 


AGTCAbLATA 


PTAPATTTPA 

GTACATTTCA 


TPTp A pT A AT 

TlTGACTAAT 


APPAPA AO AO 

ACCACAACAC 


PAPPAPPATP 

CACCACCATG 


A AT AP APP AT 

AATAGAGGAT 


7620 


TCTTTAACAT 


GCTCGGCCGC 


O /V* O O PTTP P 

CGCCCCTTCC 


/■» a ppppor a p 

CAGCCCCCAC 


TGCCATGTGG 


AGGCCGCGGA 
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nil rip A r\f~-r*. a 

GAAGGAGGCA 
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GGCGGCCCCG 


ATGCCTGCCC 


PP A A TOr»Pf**T 

GCAATGGGCT 


GGCTTCCCAA 


ATCCAGCaAAC 


7740 
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PT* A PTP A TTP 
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PAPA PPP A A P 
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TT A P A P f«^*P A A 
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7800 


CACGCCCGCC 


GCCGCGCCAG 


A AO A a c> /■> 

AAGAAGCAGG 
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Ann a AAA A A /-N 

CCGAAGAAAC 
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TGTTCGACGT 
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GACGGA6ATG 


7980 


TCATCGGGCA 


CGCACTGGCC 


ATW^AAfV^AA 

AT6GAA6GAA 


AGGTAATGAA 


ACCACTCCAC 


WrAAArPAA 

GTGAAAGGAA 


8040 


CTATTGACCA 


CCCTGTGCTA 


TCAAAGCTCA 


AATTCACCAA 


GTCGTCAGCA 


TACGACATGG 


8100 


AGTTCGCACA 


GTTGCCGGTC 


AACATGAGAA 


GTGAGGCGTT 


CACCTACACC 


AGTGAACACC 


8160 


CTGAAGGGTT 


CTACAACTGG 


CACCACGGAG 


CGGTGCAGTA 


TAGTGGAGGC 


AGATTTACCA 


8220 


TCCCCCGCGG 


AGTAGgAGGC 


AGAGGAGACA 


GTGGTCGTCC 


GATTATGGAT 


AACTCAGGCC 
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GGGTTGTCGC 


GATAGTCCTC 


GGAGGGGCTG 


ATGAGGGAAC 


AAGAACCGCC 


CTTTCGGTCG 


8340 
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TCACCTGGAA TAGCAAAGGG AAGACAATCA AGACAACCCC GGAAGGGACA 6AAGAGTGGT 8400 

CTGCTGCACC ACTGGTCACG GCCATGTGCT TGCTTGGAAA CGTGAGCTTC CCATGCMTC 8460 

GCCCGCCCAC ATGCTACACC CGCGAACCAT CCAGAGCTCT CGACATCCTC GAAGAGAACG 8520 

TGAACCACGA GGCCTAC6AC ACCCTGCTCA ACGCCATATT GCGGTGCGGA TCGTCCGGCA 8580 

GAAGTAAAAG AAGCGTCACT GACGACTTTA CCTTGACCAG CCCGTACTTG GGCACATGCT 8540 

CGTACTGTCA CCATACTGAA CCGTGCTTTA GCCCGATTAA GATCGAGCAG GTCTGGGATG 8700 

AAGCGGACGA CAACACCATA CGCATACAGA CTTCCGCCCA GTTTGGATAC GACCAAAGCG 8760 

GAGCAGCAAG CTCAAATAAG TACCGCTACA TGTCGCTCGA GCAGGATCAT ACTGTCAAAG 8820 

AAGGCACCAT GGATGACATC MGATCAGCA CCTCAGGACC GTGTAGAAGG CTTAGCTACA 8880 

AAGGATACTT TCTCCTCGCG AAGTGTCCTC CAGGGGACAG CGTAACGGTT AGCATAGCGA 8940 

GTAGCAACTC AGCAACGTCA TGCACAATGG CCCGCAAGAT AAAACCAAAA TTCGTGGGAC 9000 

GGGAAAAATA TGACCTACCT CCCGTTCACG GTAAGAAGAT TCCTTGCACA GTGTACGACC 9060 

GTCTGAAAGA AACAACCGCC GGCTACATCA CTATGCACAG GCCGGGACCG CATGCCTATA 9120 

CATCCTATCT GGAGGAATCA TCAGGGAAAG TTTACGCGAA GCCACCATCC GGGAAGAACA 9180 

TTACGTACGA GTGCAAGTGC GGCGATTACA AGACCGGAAC CGTTACGACC CGTACCGAAA 9240 

TCACGGGCTG CACCGCCATC AAGCAGTGCG TCGCCTATAA GAGCGACCAA ACGAAGTGGG 9300 

TCTTCAACTC GCCGGACTCG ATCAGACACG CCGACCACAC GGCCCAAGGG AAATTGCATT 9360 

TGCCTTTCAA GCTGATCCCG AGTACCTGCA TGGTCCCTGT TGCCCACGCG CCGAACGTAG 9420 

TACACGGCTT TAAACACATC AGCCTCCAAT TAGACACAGA CCATCTGACA TTGCTCACCA 9480 

CCAGGAGACT AGGGGCAAAC CCGGAACCAA CCACTGAATG GATCATCGGA AACACGGTTA 9540 

GAAACTTCAC CGTCGACCGA GATGGCCTGG AATACATATG GGGCAATCAC GAACCAGTAA 9600 

GGGTCTATGC CCAAGAGTCT GCACCAGGAG ACCCTCACGG ATGGCCACAC GAAATAGTAC 9660 

AGCATTACTA TCATCGCCAT CCTGTGTACA CCATCTTAGC CGTCGCATCA GCTGCTGTGG 9720 

CGATGATGAT TGGCGTAACT GTTGCAGCAT TATGTGCCTG TAAAGCGCGC CGTGAGTGCC • 9780 

TGACGCCATA TGCCCTGGCC CCAAATGCCG TGATTCCAAC TTCGCTGGCA CTTTTGTGCT 9840 

GTGTTAGGTC .GGCTAATGCT GAAACATTCA CCGAGACCAT GAGnACTTA TGGTCGAACA 9900 

GCCAGCCGTT CTTCTGGGTC CAGCTGTGTA TACCTCTGGC CGCTGTCGTC GTTCTAATGC 9960 

GCTGTTGCTC ATGCTGCCTG CCTTTTTTAG TGGTTGCCGG CGCCTACCTG GCGAAGGTAG 10020 

ACGCCTACGA ACATGCGACC ACTGTTCCAA ATGTGCCACA GATACCGTAT AAGGCACTTG 10080 

TTGAAAGGGC AGGGTACGCC CCGCTCAATT TGGAGATTAC TGTCATGTCC TCGGAGGTTT 10140 

TGCCTTCCAC CAACCAAGAG TACATTACCT GCAAATTCAC CACTGTGGTC CCCTCCCCTA 10200 

AAGTCAGATG CTGCGGCTCC TTGGAATGTC AGCCCGCCGC TCACGCAGAC TATACCTGCA 10260 

AGGTCTTTGG AGGGGTGTAC CCCTTCATGT GGGGAGGAGC ACAATGTTTT TGCGACAGTG 10320 

AGAACAGCCA GATGAGTGAG GCGTACGTCG AATTGTCAGT AGATTGCGCG ACTGACCACG 10380 
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PGPAGGPGAT 


TAAGGTGPAT 


APTGPPGPGA TGAAAPTAPP. APTGPP.TATA PTPT APPPP A 
MU I uUUUUutt 1 UMMMu 1 Muu ML 1 uUu 1 M 1 M b 1 b 1 AUbbuA 
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TTTCCTAGAT 
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CTATTGACAT 
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PAPAPTTPAT 
bALAb 1 ILAI 
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LbAuLLLALA 


ppppa apttp 
bbLbAAL 1 1 L 
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11460 


ACCAAAACTC 


GACGTATTTC 


CGAGGAAGCG CAGTGCATAA TGCTGCGCAG TGTTGCCAAA 


11520 


TAATCACTAT 


ATTAACCATT 


TATTCAGCGG ACGCCAAAAC TCAATGTATT TCTGAGGAAG 


11580 


CATGGTGCAT 


AATGCCATGC 


AGCGTCTGCA TAACTTTTTA TTATTTCTTT TATTAATCAA 


11640 


CAAAATTTTG 


TTTTTAACAT 


TTC 


11663 
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That Which Is Claimed Is: 

1. A recombinant DNA comprising a cDNA coding for an 
infectious South African Arbovirus No. 86 (S. A. AR86) virus RNA transcript, and 
a heterologous promoter positioned upstream from said cDNA and operatively 

5 associated therewith. 

2. The recombinant DNA according to Claim 1, wherein said 
cDNA is selected from the group consisting of (i) cDNA having the sequence 
given herein as SEQ ID NO.: 1, (/i) cDNA having the same protein coding 
sequence as the cDNA given herein as SEQ ID NO.: 1, and (Hi) cDNA according 

10 to (0 or above, and further comprising at least one attenuating mutation in said 
cDNA. 

3. The recombinant DNA according to Claim 1, wherein said 
cDNA has the sequence given herein as SEQ ID NO.: 1. 

4. The recombinant DNA according to Claim 1, further 
15 comprising at least one attenuating mutation in said cDNA clone. 

5. The recombinant DNA according to Claim 1, further 
comprising at least two attenuating mutations in said cDNA clone. 

6. The recombinant DNA according to Claim 5, wherein each 
of said attenuating mutations are in the region selected from the group consisting 

20 of the nsPl coding region, E2 coding region, and nsP2 coding region. 

7. The recombinant DNA according to Claim 1, further 
comprising at least one attenuating mutation selected from the group consisting of 
codons at nsPl amino acid position 538 which specify an attenuating amino acid, 
codons at E2 amino acid position 304 which specify an attenuating amino acid, 

25 codons at E2 amino acid position 314 which specify an attenuating amino acid, 
codons at E2 amino acid position 372 which specify an attenuating amino acid, 
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codons at E2 amino acid position 376 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 96 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 372 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 529 which specify an attenuating amino acid, 
5 codons at nsP2 amino acid position 571 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 682 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 804 which specify an attenuating amino acid, 
and codons at nsP3 amino acid position 22 which specify an attenuating amino 
acid. 

10 8. The recombinant DNA according to Claim 7, wherein said 

attenuating mutation comprises a substitution mutation. 

9. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for isoleucine at nsPl amino acid 538. 

10. The recombinant DNA according to Claim 8, wherein said 
15 substitution mutation codes for threonine at E2 amino acid 304. 

11. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for lysine at E2 amino acid 314. 

12. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for valine at E2 amino acid 372. 

20 13. The recombinant DNA according to Claim 8, wherein said 

substitution mutation codes for alanine at E2 amino acid 376. 

14. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for glycine at nsP2 amino acid 96. 
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15. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for valine at nsP2 amino acid 372. 



16. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for leucine at nsP2 amino acid 529. 

5 17. The recombinant DNA according to Claim 8, wherein said 

substitution mutation codes for asparagine at nsP2 amino acid 571. 



18. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for arginine at nsP2 amino acid 682. 

19. The recombinant DNA according to Claim 8, wherein said 
10 substitution mutation codes for arginine at nsP2 amino acid 804. 

20. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for arginine at nsP3 amino acid 22. 

21. The recombinant DNA according to Claim 1 further 
comprising at least one silent mutation. 

15 22. The recombinant DNA according to Claim 21 , wherein said 

silent mutation is located at a position selected from the group consisting of 
nucleotide 215, nucleotide 3863, nucleotide 5984 and nucleotide 9113. 



23. The recombinant DNA according to Claim 1, wherein not 
more than eight nucleotides are positioned between said promoter and said cDNA 

20 clone. 

24. The recombinant DNA according to Claim 23, wherein said 
promoter is selected from the group consisting of T3 promoters, T7 promoters, 
and SP6 promoters. 
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25. The recombinant DNA according to Claim 1, wherein said 
recombinant DNA comprises a plasmid, and wherein said recombinant DNA 
further comprises a unique restriction site positioned downstream from said cDNA 
clone. 

5 26. An infectious RNA transcript encoded by a cDNA according 

to Claim 1. 

27. Infectious attenuated viral particles containing an RNA 
transcript of Claim 26. 

28. A pharmaceutical composition comprising an effective 
10 immunogenic amount of an infectious attenuated virus according to Claim 27 in 

combination with a pharmaceutical^ acceptable carrier. 
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